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FOCUSED LIBRARIES OF GENETIC PACKAGES 



12 This application claims the benefit under 35 USC 

13 § 120 of United States provisional application 60/256, 380, 

III 

in 

\U 5 the Tables attached to it are specifically incorporated by 



filed December 18, 2001. The provisional application and 



reference herein. 

The present invention relates to focused 
hi libraries of genetic packages that each display, display 

and express, or comprise a member of a diverse family of 
10 peptides, polypeptides or proteins and collectively 
display, display and express, or comprise at least a 
portion of the focused diversity of the family. The 
focused diversity of the libraries of this invention 
comprises both sequence diversity and length diversity. In 
15 a preferred embodiment, the focused diversity of the 

libraries of this invention is biased toward the natural 
diversity of the selected family. In a more preferred 
embodiment, the libraries are biased toward the natural 
diversity of human antibodies and are characterized by 
20 variegation in their heavy chain and light chain 
complementarity determining regions ("CDRs"). 

The present invention further relates to vectors 
and genetic packages (e.g., cells, spores or viruses) for 
displaying, or displaying and expressing a focused diverse 
25 family of peptides, polypeptides or proteins. In a 
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preferred embodiment the genetic packages are filamentous 
phage or phagemids or yeast. Again, the focused diversity 
of the family comprises diversity in sequence and diversity 
in length. 

5 The present invention further relates to methods 

of screening the focused libraries of the invention and to 
the peptides, polypeptides and proteins identified by such 
screening. 

BACKGROUND OF THE INVENTION 

10 It is now common practice in the art to prepare 



Iy libraries of genetic packages that individually display, 

in 

display and express, or comprise a member of a diverse 
family of peptides, polypeptides or proteins and 
collectively display, display and express, or comprise at 
Z 15 least a portion of the amino acid diversity of the family. 

^ In many common libraries, the peptides, polypeptides or 

0 

~ proteins are related to antibodies (e.g., single chain Fv 

£ (scFv) , Fv, Fab, whole antibodies or minibodies (i.e., 

dimers that consist of V H linked to V L ) ) . Often, they 

20 comprise one or more of the CDRs and framework regions of 
the heavy and light chains of human antibodies. 

Peptide, polypeptide or protein libraries have 
been produced in several ways in the prior art. See e.g., 
Knappik et al., J. Mol. Biol., 296, pp. 57-86 (2000), which 

25 is incorporated herein by references. One method is to 
capture the diversity of native donors, either naive or 
immunized. Another way is to generate libraries having 
synthetic diversity. A third method is a combination of 
the first two. Typically, the diversity produced by these 

30 methods is limited to sequence diversity, i.e., each member 
of the library differs from the other members of the family 
by having different amino acids or variegation at a given 
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position in the peptide, polypeptide or protein chain. 
Naturally diverse peptides, polypeptides or proteins, 
however, are not limited to diversity only in their amino 
acid sequences. For example, human antibodies are not 
5 limited to sequence diversity in their amino acids, they 
are also diverse in the lengths of their amino acid chains. 

For antibodies, diversity in length occurs, for 
example, during variable region rearrangements. See e.g., 
Corbett et al . , J. Mol. Biol., 270, pp. 587-97 (1997). The 
M 10 joining of V genes to J genes, for example, results in the 

£ _ 1 

*5 inclusion of a recognizable D segment in CDR3 in about half 

l« of the heavy chain antibody sequences, thus creating 

CP 

k n regions encoding varying lengths of amino acids. The 

jjjj following also may occur during joining of antibody gene 

s 15 segments: (i) the end of the V gene may have zero to 

17, several bases deleted or changed; (ii) the end of the D 

lU 

| a segment may have zero to many bases removed or changed; 

ig 

J- (iii) a number of random bases may be inserted between V 

!« and D or between D and J; and (iv) the 5* end of J may be 

20 edited to remove or to change several bases. These 

rearrangements result in antibodies that are diverse both 
in amino acid sequence and in length. 

Libraries that contain only amino acid sequence 
diversity are, thus, disadvantaged in that they do not 

25 reflect the natural diversity of the peptide, polypeptide 
or protein that the library is intended to mimic. Further, 
diversity in length may be important to the ultimate 
functioning of the protein, peptide or polypeptide. For 
example, with regard to a library comprising antibody 

30 regions, many of the peptides, polypeptides, proteins 
displayed, displayed and expressed, or comprised by the 
genetic packages of the library may not fold properly or 
their binding to an antigen may be disadvantaged, if 
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diversity both in sequence and length are not represented 
in the library. 

An additional disadvantage of prior art libraries 
of genetic packages that display, display and express, or 
5 comprise peptides, polypeptides and proteins is that they 
are not focused on those members that are based on natural 
occurring diversity and thus on members that are most 
likely to be functional. Rather, the prior art libraries, 
typically, attempt to include as much diversity or 
U 10 variegation at every amino acid residue as possible. This 

makes library construction time-consuming and less 

fU efficient than possible. The large number of members that 

ifi 

l~ are produced by trying to capture complete diversity also 

!U makes screening more cumbersome than it needs to be. This 

* ff ' 15 is particularly true given that many members of the library 
* A will not be functional. 



SUMMARY OF THE INVENTION 

One objective of this invention is focused 
libraries of vectors or genetic packages that encode 

20 members of a diverse family of peptides, polypeptides or 
proteins wherein the libraries encode populations that are 
diverse in both length and sequence. The diverse length 
comprising components that contain motifs that are likely 
to fold and function in the context of the parental 

25 peptide, polypeptide or protein. 

Another object of this invention is focused 
libraries of genetic packages that display, display and 
express, or comprise a member of a diverse family of 
peptides, polypeptides and proteins and collectively 

30 display, display and express, or comprise at least a 
portion of the focused diversity of the family. These 
libraries are diverse not only in their amino acid 
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sequences, but also in their lengths. And, their diversity 
is focused so as to more closely mimic or take into account 
the naturally-occurring diversity of the specific family 
that the library represents. 
5 Another object of this invention is diverse, but 

focused, populations of DNA sequences encoding peptides, 
polypeptides or proteins suitable for display or display 
and expression using genetic packages (such as phage or 
phagemids) or other regimens that allow selection of 

|* 10 specific binding components of a library. 

I J 

q A further object of this invention is focused 

libraries comprising the CDRs of human antibodies that are 
diverse in both their amino acid sequence and in their 
length (examples of such libraries include libraries of 
15 single chain Fv (scFv) , Fv, Fab, whole antibodies or 

5 s £ minibodies (i.e., dimers that consist of V H linked to V L ) ) . 

iy 

I «* Such regions may be from the heavy or light chains or both 

and may include one or more of the CDRs of those chains. 
More preferably, the diversity or variegation occurs in all 
20 of the heavy chain and light chain CDRs. 

It is another object of this invention to provide 
methods of making and screening the above libraries and the 
peptides, polypeptides and proteins obtained in such 
screening. 

25 Among the preferred embodiments of this invention 

are the following: 

1. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
30 peptides, polypeptides and proteins and collectively 
display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
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variegated DNA sequences that encode a heavy chain CDR1 
selected from the group consisting of: 

(1) <1> 1 Y 2 <1> 3 M 4 <1> 5 , wherein <1> is an 
equimolar mixture of each of amino acid residues A, D, E, 

5 F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, and Y; 

(2) (S/T) 1 (S/G/X) 2 (S/G/X) 3 Y 4 Y 5 W 6 (S/G/X) 7 . 
wherein (S/T) is a 1:1 mixture of S and T residues, (S/G/X) 
is a mixture of 0.2025 S, 0.2025 G and 0.035 of each of 
amino acid residues A, D, E, F, H, I, K, L, M, N, P, Q, R, 

I* 10 T, V, W, and Y; 

U (3) V 1 S 2 G 3 G 4 S 5 I 6 S 7 <1> 8 <1> 9 <1> 10 Y 11 Y 12 W 13 <1> 14/ 

wherein <1> is an equimolar mixture of each of amino acid 
Jg residues A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, 

lz W, and Y; and 

U i 

9 " 15 (4) mixtures of vectors or genetic packages 

characterized by any of the above DNA sequences , preferably 

ass 

jl in the ratio: HC CDRls (1) : (2) : (3) : : 0 . 80 : 0 . 17 : 0 . 02 . 
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2. A focused library of vectors or genetic 
packages that display , display and express, or comprise a 

20 member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
display, display and express, or comprise at least a 
portion of the diversity of the antibody facility, the 
vectors or genetic packages being characterized by 

25 variegated DNA sequences that encode a heavy chain CDR2 
selected from the group consisting of: 

(1) <2>K2><3>SGG<1>T<1>YADSVKG, wherein 
<1> is an equimolar mixture of each of amino acid residues 
A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, and 

30 Y; <2> is an equimolar mixture of each of amino acid 

residues Y, R, W, V, G, and S; and <3> is an equimolar 
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mixture of each of amino acid residues P, S, and G or an 
equimolar mixture of P and S; 

(2 ) <1>I<4X1><1><G><5><1><1><1>YADSVKG, 
wherein <1> is an equimolar mixture of each of amino acid 

5 residues A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, 
W, and Y; <4> is an equimolar mixture of residues D, I, N, 
S, W, Y; and <5> is an equimolar mixture of residues S, G, 
D and N; 

(3) <l>I<4Xlxl>G<5xlxl>YNPSLKG, wherein 
10 <1> is an equimolar mixture of each of amino acid residues 

A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; 
and <4> and <5> are as defined above; 

n 

g (4) <1>K8>S<1X1X1>GGYY<1>YAASVKG, 

z wherein <1> is an equimolar mixture of each amino acid 

n 

15 residues A, D, E, F, G, H, I, K, L, M, N f P, Q, R, S, T, V, 

W and Y; <8> is 0.27 R and 0.027 of each of 
ADE FGHI KLMN PQSTVWY ; and 

(5) mixtures of vectors or genetic packages 
characterized by any of the above DNA sequences, preferably 
20 in the ratio: HC CDR2s: (l)/(2) (equimolar): 
(3) : (4) : :0. 54:0. 43:0. 03. 



3. A focused library of vectors or genetic 
packages that display, display and express , or comprise a 
member of a diverse family of human antibody related 

25 peptides, polypeptides and proteins and collectively 
display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
variegated DNA sequences that encode a heavy chain CDR3 

30 selected from the group consisting of: 

(1) YYCA2 1 1 1 1 YFDYWG , wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 
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G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is an 
equimolar mixture of K and R; 

(2) YYCA2 1111 1 1 YFDYWG , wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 

5 G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is an 
equimolar mixture of K and R; 

(3) YYCA2 111111 1 1 YFDAYTG , wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 

G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is an 
I* 10 equimolar mixture of K and R; 

5S (4) Y YCAR1 11S2S311 1 YFDYWG , wherein 1 is an 

Sss? 

i"U equimolar mixture of each amino acid residues A, D, E, F, 

CO 

,n G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is an 

as s 

?y equimolar mixture of S and G; and 3 is an equimolar mixture 

J'"' 15 of Y and W; 

(5) YYCA2111CSG11CY1 YFDYWG, wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 
G, H, I, K, L, M, N, P, Q, R, S f T, V, W and Y; and 2 is an 



a 

U equimolar mixture of K and R; 

20 (6) YYCA211S1TIFG11111YFDYWG, wherein 1 is 

an equimolar mixture of each amino acid residues A, D, E, 
F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is 
an equimolar mixture of K and R; 

(7) YYCAR111YY2S334 4 111YFDYWG, wherein 1 is 
25 an equimolar mixture of each amino acid residues A, D, E, 

F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; 2 is an 
equimolar mixture of D and S; and 3 is an equimolar mixture 
of S and G; 

(8) YYCAR1111YC2231CY111YFDYWG, wherein 1 
30 is an equimolar mixture of each amino acid residues A, D, 

E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; 2 is 
an equimolar mixture of S and G; and 3 is an equimolar 
mixture of T, D and G; and 
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(9) mixtures of vectors or genetic packages 
characterized by any of the above DNA sequences, preferably 
the HC CDR3s (1) through (8) are in the following 
proportions in the mixture: 
5 (1) 0.10 

(2) 0,14 

(3) 0.25 

(4) 0.13 

(5) 0.13 

l-s. 10 (6) 0.11 

{3 (7) 0.04 and 

Jy (8) 0.10; and more preferably the HC CDR3s 

f {? (1) through (8) are in the following proportions in the 

IU mixture: 

f 15 (1) 0.02 

I* (2) 0.14 

10 (4) 0.14 

H (5) 0.14 

20 (6) 0.12 

(7) 0.08 and 

(8) 0.11. 

Preferably, 1 in one or all of HC CDR3s (1) 
through (8) is 0.095 of each of G and Y and 0.048 of each 
25 of A, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, and W. 

4. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
30 display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
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variegated DNA sequences that encodes a kappa light chain 
CDR1 selected from the group consisting of: 

( 1 ) RASQ<1>V<2X2><3>LA 

(2) RASQ<l>V<2x2x2X3>LA; 

5 wherein <1> is an equimolar mixture of amino acid residues 
ADE FGH I KLMN PQRS T VW Y ; <2> is 0.2 S and 0.044 of each of 
ADEFGHIKLMNPQRTVWY; and <3> is 0.2Y and 0.044 each of 
ADE FGH I KLMN PQRT VW and Y; and 

(3) mixtures of vectors or genetic packages 
\a 10 characterized by any of the above DNA sequences, preferably 

in the ratio CDRls (1) : (2) : : 0 . 68 : 0 . 32 . 



ru 

m 



5. A focused library of vectors or genetic 



packages that display, display and express, or comprise a 

in 

a " ' member of a diverse family of human antibody related 

** 15 peptides, polypeptides and proteins and collectively 

display, display and express, or comprise at least a 
^2 portion of the diversity of the antibody family, the 

jU vectors or genetic packages being characterized by 

variegated DNA sequences that encode a kappa light chain 
20 CDR2 having the sequence: 

<1>AS<2>R<4X1>, 
wherein <1> is an equimolar mixture of amino acid residues 
ADE FGHIKLMNPQRSTVWY; <2> is 0.2 S and 0.044 of each of 
ADEFGHIKLMNPQRTVWY; and <4> is 0.2 A and ) 0.044 each of 
25 DE FGH I KLMN PQRS T VW Y . 



6. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
30 display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
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vectors or genetic packages being characterized by 
variegated DNA sequences that encode a kappa light chain 
CDR3 selected from the groups consisting of: 

(1) QQ<3Xl><l><l>P<l>T, 
5 wherein <1> is an equimolar mixture of amino acid residues 

ADEFGHIKLMNPQRSTVWY; <3> is 0.2 Y and 0.044 each of 
ADE FGH I KLMN PQRTVW ; 

(2) QQ33111P, wherein 1 and 3 are as 
defined in (1) above; 

10 (3) QQ3211PP1T, wherein 1 and 3 are as 

defined in (1) above and 2 is 0.2 S and 0.044 each of 
ADEFGHIKLMNPQRTVWY; and 
In (4) mixtures of vectors or genetic packages 

W characterized by any of the above DNA sequences, preferably 

l n 

15 in the ratio CDR3s (1) : (2) : (3) : : 0 . 65 : 0 . 1 : 0 . 25 . 

\1 7. A focused library of vectors or genetic 

packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
20 display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
variegated DNA sequences that encode a lambda light chain 
CDR1 selected from the group consisting of: 
2 5 (1) TG<1>SS<2>VG<1X3><2X3>VS, 

wherein <1> is 0.27 T, 0.27 G and 0.027 each of 
ADEFHIKLMNPQRSVWY, <2> is 0.27 D, 0.27 N and 0.027 each of 
AE FGH I KLMPQRS T VW Y , and <3> is 0.36 Y and 0.036 each of 
ADEFGHIKLMNPQRSTVW; 
30 (2) G<2X4>L<4X4X4X3X4X4>, 
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wherein <2> is as defined in (1) above and <4> is an 
equimolar mixture of amino acid residues 
ADE FGH I KLMN PQRS T VW Y ; and 

(3) mixtures of vectors or genetic packages 
5 characterized by any of the above DNA sequences, preferably 
in the ratio CDRls (1) : (2) : : 0 . 67 : 0 . 33 . 

8. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
10 peptides, polypeptides and proteins and collectively 

U display, display and express, or comprise at least a 

0 

q portion of the diversity of the antibody family, the 

U vectors or genetic packages being characterized by 

n 

variegated DNA sequences that encode a lambda light chain 
15 CDR2 has the sequence: 

<4><4><4><2>RPS, 
wherein <2> is 0.27 D, 0.27 N, and 0.027 each of 
AEFGHIKLMPQRSTVWY and <4> is an equimolar mixture of amino 
acid residues ADEFGHIKLMNPQRSTVW. 

20 9. A focused library of vectors or genetic 

packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
display, display and express, or comprise at least a 
25 portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
variegated DNA sequences that encode a lambda light chain 
CDR3 selected from the group consisting of: 

(1) <4><5><4><2><4>S<4X4><4><4>V, 
30 wherein <2> is 0.27 D, 0.27 N, and 0.027 each of 

AEFGHIKLMPQRSTVWY; <4> is an equimolar mixture of amino 
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acid residues ADEFGHIKLMNPQRSTVW; and <5> is 0.36 S and 
0.0355 each of ADE FGH I KLMN PQRT VWY ; 

(2) <5>SY<1X5>S<5X1X4>V, wherein <1> is 
an equimolar mixture of ADEFGHIKLMNPQRSTVWY; and <4> and 
<5> are as defined in (1) above; and 

(3) mixtures of vectors or genetic packages 
characterized by any of the above DNA sequences, preferably 
in the ratio CDR3s (1) : (2) : : 1 : 1 . 

10. A focused library comprising variegated DNA 
sequences that encode a heavy chain CDR selected from the 
group consisting of: 

(1) one or more of the heavy chain CDRls of 

paragraph 1 above; 

(2) one or more of the heavy chain CDR2s of 

paragraph 2 above; 

(3) one or more of the heavy chain CDR3s of 

paragraph 3 above; and 

(4) mixtures of vectors or genetic packages 

characterized by (1), (2) and (3). 

11. The focused library comprising one or more 
of the variegated DNA sequences that encodes a heavy chain 
CDR of paragraphs 1, 2 and 3 and further comprising 
variegated DNA sequences that encodes a light chain CDR 
selected from the group consisting of 

(1) one or more the kappa light chain CDRls 

of paragraph 4; 

(2) the kappa light chain CDR2 of 

paragraph 5; 

(3) one or more of the kappa light chain 
CDR3s of paragraph 6; 
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(4) one or more of the kappa light chain 

CDRls of paragraph 7; 

(5) the lambda light chain CDR2 of 

paragraph 8 ; 

(6) one or more of the lambda light chain 
CDR3s of paragraph 9; and 

(7) mixtures of vectors and genetic 
packages characterized by one or more of (1) through (6) . 

12. A population of variegated DNA sequences as 
described in paragraphs 1-11 above. 

13. A population of vectors comprising the 
variegated DNA sequences as described in paragraphs 1-11 
above . 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Antibodies ("Ab") concentrate their diversity 
into those regions that are involved in determining 
affinity and specificity of the Ab for particular targets. 
These regions may be diverse in sequence or in length. 
Generally, they are diverse in both ways. However, within 
families of human antibodies the diversities, both in 
sequence and in length, are not truly random. Rather, some 
amino acid residues are preferred at certain positions of 
the CDRs and some CDR lengths are preferred. These 
preferred diversities account for the natural diversity of 
the antibody family. 

According to this invention, and as more fully 
described below, libraries of vectors and genetic packages 
that more closely mirror the natural diversity, both in 
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sequence and in length, of antibody families, or portions 
thereof are prepared and used. 

Human Antibody Heavy Chain Sequence and Length Diversity 

(a) Framework 

The heavy chain ("HC") Germ-Line Gene (GLG) 3-23 
(also known as VP-47) accounts for about 12% of all human 
Abs and is preferred as the framework in the preferred 
embodiment of the invention. It should, however, be 
understood that other well-known frameworks, such as 4-34, 
3-30, 3-30.3 and 4-30.1, may also be used without departing 
from the principles of the focused diversities of this 
invention. 

In addition, JH4 ( YFDYWGQGTLVTUSS ) occurs more 
often than JH3 in native antibodies. Hence, it is 
preferred for the focused libraries of this invention. 
However, JH3 (AFDIWGQGTMVTVSS) could as well be used. 

(b) Focused Length Diversity: CDR1 , 2 and 3 
(i) CDR1 

For CDRl, GLGs provide CDRls only of the lengths 
5, 6, and 7. Mutations during the maturation of the V- 
domain gene, however, can lead to CDRls having lengths as 
short as 2 and as long as 16. Nevertheless, length 5 
predominates. Accordingly, in the preferred embodiment of 
this invention, the preferred HC CDRl is 5 amino acids, 
with less preferred CDRls having lengths of 7 and 14. In 
the most preferred libraries of this invention, all three 
lengths are used in proportions similar to those found in 
natural antibodies . 



(ii) CDR2 

GLGs provide CDR2s only of the lengths 15-19, but 
mutations during maturation may result in CDR2s of lengths 
from 16 to 28 amino acids. The lengths 16 and 17 
predominate in mature Ab genes. Accordingly , length 17 is 
the preferred length for HC CDR2 of the present invention. 
Less preferred HC CDR2s of this invention have lengths 16 
and 19. In the most preferred focused libraries of this 
invention, all three lengths are included in proportions 
similar to those found in natural antibody families. 

(iii) CDR3 

HC CDR3s vary in length. About half of human HCs 
consist of the components: V: : nz : : D: : ny : : JHn where V is a V 
gene, nz is a series of bases (mean 12) that are 
essentially random, D is a D segment, often with heavy 
editing at both ends, ny is a series of bases (mean 6) that 
are essentially random, and JH is one of the six JH 
segments, often with heavy editing at the 5 f end. The D 
segments appear to provide spacer segments that allow 
folding of the IgG. The greatest diversity is at the 
junctions of V with D and of D with JH. 

In the preferred libraries of this invention both 
types of HC CDR3s are used. In HC CDR3s that have no 
identifiable D segment, the structure is V::nz::JHn where 
JH is usually edited at the 5 f end. In HC CDR3s that have 
an identifiable D segment, the structure is 
V: :nz : : D: : ny : : JHn. 
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(c) Focused Sequence Diversity: CDRl, 2 and 3 
(i) CDR1 

In 5 amino acid length CDRl, examination of a 3D 
model of a humanized Ab showed that the side groups of 
5 residues 1, 3, and 5 were directed toward the combining 
pocket. Consequently , in the focused libraries of this 
invention, each of these positions may be selected from any 
of the native amino acid residues, except cysteine ("C") . 
Cysteine can form disulfide bonds, which are an important 
i3 10 component of the canonical Ig fold. Having free thiol 

in groups could, thus, interfere with proper folding of the HC 



and could lead to problems in production or manipulation of 



m 

m selected Abs. Thus, in the focused libraries of this 



invention cysteine is excluded from positions 1, 3 and 5 of 

15 the preferred 5 amino acid CDRls. The other 19 natural 
amino acids residues may be used at positions 1, 3 and 5, 
Preferably, each is present in equimolar ratios in the 
variegated libraries of this invention. 

3D modeling also suggests that the side groups of 

20 residue 2 in a 5 amino acid CDRl are directed away from the 
combining pocket. Although this position shows substantial 
diversity, both in GLG and mature genes, in the focused 
libraries of this invention this residue is preferably Tyr 
(Y) because it occurs in 681/820 mature antibody genes. 

25 However, any of the other native amino acid residues, 
except Cys (C) , could also be used at this position. 

For position 4, there is also some diversity in 
GLG and mature antibody genes. However, almost all mature 
genes have uncharged hydrophobic amino acid residues: A, G, 

30 L, P, F, M, W, I, V, at this position. Inspection of a 3D 
model also shows that the side group of residue 4 is packed 
into the innards of the HC. Thus, in the preferred 
embodiment of this invention which uses framework 3-23, 
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residue 4 is preferably Met because it is likely to fit 
very well into the framework of 3-23. With other 
frameworks, a similar fit consideration is used to assign 
residue 4. 

5 Thus, the most preferred HC CDRl of this 

invention consists of the amino acid sequence <1>Y<1>M<1> 
where <1> can be any one of amino acid residues: A, D, E, 

F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, Y (not C) , 
preferably present at each position in an equimolar amount. 

IM 10 This diversity is shown in the context of a framework 3- 

I;; 23:JH4 in Table 1. It has a diversity of 6859-fold. 

The two less preferred HC CDRls of this invention 
have length 7 and length 14. For length 7, a preferred 
jy variegation is (S/T) 1 (S/G/<1>) 2 (S/G/<1>) 3 Y 4 Y 5 W 6 (S/G/<1>) 7 ; 

15 where (S/T) indicates an equimolar mixture of Ser and Thr 
codons; (S/G/<1>) indicates a mixture of 0.2025 S, 0.2025 

G, and 0.035 for each of A, D, E, F, H, I, K, L, M, N, P, 
Q, R, T, V, W, Y. This design gives a predominance of Ser 
and Gly at positions 2, 3, and 7, as occurs in mature HC 

20 genes. For length 14, a preferred variegation is 

VSGGSIS<1X1X1>YYW<1>, where <1> is an equimolar mixture 
of the 19 native amino acid residues, except Cys (C) . 

The DNA that encodes these preferred HC CDRls is 
preferably synthesized using trinucleotide building blocks 
25 so that each amino acid residue is present in essentially 

equimolar or other described amounts. The preferred codons 
for the <1> amino acid residues are get, gat, gag, ttt, 
ggt, cat, att, aag, ctt, atg, aat, cct, cag, cgt, tct, act, 
gtt, tgg, and tat. Of course, other codons for the chosen 
30 amino acid residue could also be used. 

The diversity oligonucleotide (ON) is preferably 
synthesized from BspEI to BstXI (as shown in Table 1) and 
can, therefore, be incorporated either by PCR synthesis 
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using overlapping ONs or introduced by ligation of 
BspEI/EstXI-cut fragments. Table 2 shows the 
oligonucleotides that embody the specified variegations of 
the preferred length 5 HC CDRls of this invention. PCR 
5 using ON-RlVlvg, ON-Rltop, and ON-Rlbot gives a dsDNA 
product of 73 base pairs, cleavage with BspEI and BstXI 
trims 11 and 13 bases from the ends and provides cohesive 
ends that can be ligated to similarly cut vector having the 
3-23 domain shown in Table 1. Replacement of ON-RlVlvg 
M 10 with either 0NRlV2vg or ONRlV3vg (see Table 2) allows 

j"5 synthesis of the two alternative diversity patterns — the 

*U 7 residue length and the 14 residue length HC CDR1 . 

In 

t Q The more preferred libraries of this invention 



fU 



comprise the 3 preferred HC CDR1 length diversities. Most 
15 preferably, the 3 lengths should be incorporated in 

approximately the ratios in which they are observed in 
antibodies selected without reference to the length of the 
CDRs. For example, one sample of 10 95 HC genes have the 
three lengths present in the ratio: 
20 L=5:L=7:L=14: :820:175:23: :0. 80:0.17:0. 02. This is the 
preferred ratio in accordance with this invention. 



(ii) CDR2 

Diversity in HC CDR2 was designed with the same 
considerations as for HC CDR1: GLG sequences, mature 

25 sequences and 3D structure. A preferred length for CDR2 is 
17, as shown in Table 1. For this preferred 17 length 
CDR2, the preferred variegation in accordance with the 
invention is: <2>I<2><3>SGG<1>T<1>YADSVKG, where <2> 
indicates any amino acid residue selected from the group of 

30 Y, R, W, V, G and S (equimolar mixture) , <3> is P, S and G 
or P and S only (equimolar mixture) , and <1> is any native 
amino acid residue except C (equimolar mixture) . 
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ON-R2Vlvg shown in Table 3 embodies this 
diversity pattern. It is preferably synthesized so that 
fragments of dsDNA containing the BstXI and Xbal site can 
be generated by PCR. PGR with 0N-R2Vlvg, ON-R2top,and 
5 ONR2bot gives a dsDNA product of 122 base pairs. Cleavage 
with BstXI and Xbal removes about 10 bases from each end 
and produces cohesive ends that can be ligated to similarly 
cut vector that contains the 3-23 gene shown in Table 1. 

In an alternative embodiment for a 17 length HC 
10 CDR2 , the following variegation may be used: 

<1>I<4><1><1>G<5><1><1><1>YADSVKG, where <1> is as 
described above for the more preferred alternative of HC 
CDR2; <4> indicates an equimolar mixture of DINSWY, and <5> 
indicates an equimolar mixture of SGDN. This diversity 
15 pattern is embodied in ON-R2V2vg shown in Table 3. 

Preferably, the two embodiments are used in equimolar 
mixtures in the libraries of this invention. 

Other preferred HC CDR2s have lengths 16 and 19. 
H Length 16: <1>K4><1><1>G<5<1X1>YNPSLKG; Length 19: 

20 <l>K8>S<lxlxl>GGYY<l>YAASVKG, wherein <1> is an 

equimolar mixture of all native amino acid residues except 
C; <4> is a equimolar mixture of DINSWY; <5> is an 
equimolar mixture of SGDN; and <8> is 0.27 R and 0.027 of 
each of residues ADEFGHIKLMNPQSTVWY . Table 3 shows ON- 
25 R2V3vg which embodies a preferred CDR2 variegation of 

length 16 and ON-R2V4vg which embodies a preferred CDR2 
variegation of length 19. To prepare these variegations 
ON-R2V3vg may be PCR amplified with ON-R2top and ON-R2bo3 
and ON-R2V4vg may be PCR amplified with ON-R2top and 0N-R2- 
30 bo4 . See Table 3. In the most preferred embodiment of 
this invention, all three HC CDR2 lengths are used. 
Preferably, they are present in a ratio 
17: 16: 19: : 579:464:31: : 0.54: 0. 43: 0. 03. 



m 

•z'sg 



- 21 - 



(iii) CDR3 

The preferred libraries of this invention 
comprise several HC CDR3 components. Some of these will 
have only sequence diversity. Others will have sequence 
5 diversity with embedded D segments to extend the length, 
while also incorporating sequences known to allow Igs to 
fold. The HC CDR3 components of the preferred libraries of 
this invention and their diversities are depicted in 
Table 4: Components 1-8. 
H 10 This set of components was chosen after studying 

p the sequences of 1383 human HC sequences. The proposed 

components are meant to fulfill the following goals: 

\Q 1) approximately the same distribution of lengths 

W 

as seen in native Ab genes; 
s 15 2) high level of sequence diversity at places 

i£ 

;, 5 having high diversity in native Ab genes; and 

3) incorporation of constant sequences often seen 
in native Ab genes. 

Component 1 represents all the genes having 
20 lengths 0 to 8 (counting from the YYCAR motif at the end of 
FR3 to the WG dipeptide motif near the start of the J 
region, i.e., FR4). Component 2 corresponds the all the 
genes having lengths 9 or 10. Component 3 corresponds to 
the genes having lengths 11 or 12 plus half the genes 
25 having length 13. Component 4 corresponds to those having 
length 14 plus half those having length 13. Component 5 
corresponds to the genes having length 15 and half of those 
having length 16. Component 6 corresponds to genes of 
length 17 plus half of those with length 16. Component 7 
30 corresponds to those with length 18 . Component 8 

corresponds to those having length 19 and greater. See 
Table 4. 



in 
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For each HC CDR3 residue having the diversity 
<1>, equimolar ratios are preferably not used. Rather, the 
following ratios are used 0.095 [G and Y] and 0.048 [A, D , 
E, F, H, I, K, L, M, N, P, Q, R, S, T, V, and W] . Thus, 
5 there is a double dose of G and Y with the other residues 
being in equimolar ratios. For the other diversities, 
e.g., KR or SG, the residues are present in equimolar 
mixtures . 

In the preferred libraries of this invention the 
M-. 10 eight components are present in the following fractions: 1 

li (0.10), 2 (0.14), 3 (0.25), 4 (0.13), 5 (0.13), 6 (0.11), 7 

fU (0.04) and 8 (0.10). See Table 4. 

cn 

,n In the more preferred embodiment of this 

invention, the amounts of the eight components is adjusted 



15 because the first component is not complex enough to 

justify including it as 10% of the library. For example, 
if the final library were to have 1 x 10 9 members, then 
1 x 10 8 sequences would come from component 1, but it has 
only 2.6 x 10 5 CDR3 sequences so that each one would occur 

20 in -385 CDR1/2 contexts. Therefore, the more preferred 
amounts of the eight components are 1(0.02), 2(0.14), 
3(0.25), 4(0.14), 5(0.14), 6(0.12), 7(0.08), 8(0.11). In 
accordance with the more preferred embodiment component 1 
occurs in -77 CDR1/2 contexts and the other, longer CDR3s 

25 occur more often. 

Table 5 shows vgDNA that embodies each of the 
eight HC CDR3 components shown in Table 4. In Table 5, the 
oligonucleotides (ON) Ctop25, CtprmA, CBprmB, and CBot25 
allow PCR amplification of each of the variegated ONs 

30 (vgDNA) : Clt08, C2tl0, C3tl2, C4tl4, C5tl5, C6tl7, C7tl8, 
and C8tl9. After amplification, the dsDNA can be cleaved 
with Aflll and BstEII (or Kpnl) and ligated to similarly 
cleaved vector that contains the remainder of the 3-23 
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domain. Preferably, this vector already contains diversity 
in one, or both, of CDR1 and CDR2 as disclosed herein. 
Most preferably, it contains diversity in both the CDRl and 
CDR2 regions. It is, of course, to be understood that the 
5 various diversities can be incorporated into the vector in 
any order. 

Preferably, the recipient vector originally 
contains a stuffer in place of CDRl, CDR2 and CDR3 so that 
there will be no parental sequence that would then occur in 
10 the resulting library. Table 6 shows a version of the V3- 
23 gene segment with each CDR replaced by a short segment 
Itj that contains both stop codons and restriction sites that 

*g will allow specific cleavage of any vector that does not 

fU have the stuffer removed. The stuffer can either be short 

III 

15 and contain a restriction enzyme site that will not occur 



#1 



in the finished library, allowing removal of vectors that 



ssS: 
ffj 

12 are not cleaved by both A fill and BstEIl (or Kpnl) and 



religated. Alternatively, the stuffer could be 200-400 
bases long so that uncleaved or once cleaved vector can be 
20 readily separated from doubly cleaved vector. 

Human Antibody Light Chain: Sequence and Length Diversity 
(i) Kappa Chain 
(a) Framework 

In the preferred embodiment of this invention, 
25 the kappa light chain is built in an A27 framework with a 
JK1 region. These are the most common V and J regions in 
the native genes. Other frameworks, such as 012, L2, and 
All, and other J regions, such as JK4, however, may be used 
without departing from the scope of this invention. 
30 (b) CDRl 

In native human kappa chains, CDRls with lengths 
of 11, 12, 13, 16, and 17 were observed with length 11 
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being predominant and length 12 being well represented. 
Thus, in the preferred embodiments of this invention LC 
CDRls of length 11 and 12 are used in an and mixture 
similar to that observed in native antibodies), length 11 
5 being most preferred. Length 11 has the following sequence: 
RASQ<1>V<2><2X3>LA and Length 12 has the following 
sequence: RASQ<1>V<2><2><2><3>LA, wherein <1> is an 
equimolar mixture of all of the native amino acid residues, 
except C, <2> is 0.2 S and 0.044 of each of 

H 10 A DE FGH I KLMN P QRT VW Y , and <3> is 0 . 2 Y and 0.044 each of A, 

D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W and Y. In 

lU the most preferred embodiment of this invention, both CDR1 

lengths are used. Preferably, they are present in a ratio 

HJ of 11:12: :154:73: :0. 68:0. 32. 

in 

r 15 (c) CDR2 

£ JE. 

IZ In native kappa, CDR2 exhibits only length 7. 

I* This length is used in the preferred embodiments of this 

% invention. It has the sequence <1>AS<2>R<4><1>, wherein <1> 

|« is an equimolar mixture of amino acid residues 

20 ADEFGHIKLMNPQRSTVWY; <2> is 0.2 S and 0.004 of each of 

ADE FGH I KLMN PQRTVW Y ; and <4> is 0.2 A and 0.044 of each of 
DE FGH I KLMN PQR S T U W Y . 

(d) CDR3 

In native kappa, CDR3 exhibits lengths of 1, 4, 
25 6, 7, 8, 9, 10, 11, 12, 13, and 19. While any of these 
lengths and mixtures of them can be employed in this 
invention, we prefer lengths 8, 9 and 10, length 9 being 
more preferred. For the preferred Length 9, the sequence is 
QQ<3X1><1X1>P<1>T, wherein <1> is an equimolar mixture of 
30 amino acid residues ADEFGHIKLMNPQRSTVWY and <3> is 0.2 Y 
and 0.044 each of ADEFGHIKLMNPQRSVW. Length 8 is 
preferably QQ33111P and Length 10 is preferably QQ3211PP1T, 
wherein 1 and 3 are as defined for Length 9 and 2 is S 
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to 



(0.2) and 0.044 each of ADEFGHIKLMNPQRTVWY . A mixture of 
all 3 lengths being most preferred (ratios as in native 
antibodies) , i.e., 8:9:10: : 28:166: 63 ::0. 1:0. 65:0. 25. 

Table 7 shows a kappa chain gene of this 
5 invention, including a PlacZ promoter, a ribosome-binding 
site, and signal sequence (M13 III signal) . The DNA 
sequence encodes the GLG amino acid sequence, but does not 
comprise the GLG DNA sequence. Restriction sites are 
designed to fall within each framework region so that 
jk 10 diversity can be cloned into the CDRs. Xmal and Espl are 

n in FR1, SexAI is in FR2, RsrII is in FR3, and Kpnl (or 

Acc65I) are in FR4 . Additional sites are provided in the 
constant kappa chain to facilitate construction of the 

:!f gene. 

Ill 

s 15 Table 7 also shows a suitable scheme of 

variegation for kappa. In CDRl, the most preferred length 
U 11 is depicted. However, most preferably both lengths 11 

!~ and 12 are used. Length 12 in CDRl can be construed by 

U introducing codon 51 as <2> (i.e. a Ser-biased mixture). 

20 CDR2 of kappa is always 7 codons. Table 7 shows a 

preferred variegation scheme for CDR2. Table 7 shows a 
variegation scheme for the most preferred CDR3 (length 9) . 
Similar variegations can be used for CDRs of length 8 and 
10. In the preferred embodiment of this invention, those 
25 three lengths (8, 9 and 10) are included in the libraries 
of this invention in the native ratios, as described above, 

Table 9 shows series of diversity 
oligonucleotides and primers that may be used to construct 
the kappa chain diversities depicted in Table 7. 
30 <ii) Lambda Chain 

(a) Framework 

The lambda chain is preferably built in a 2a2 
framework with an L2J region. These are the most common V 
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and J regions in the native genes. Other frameworks, such 
as 31, 4b, la and 6a, and other J regions, such as L1J, L3J 
and L7J, however, may be used without departing from the 
scope of this invention. 
5 (b) CDR1 

In native human lambda chains, CDRls with length 
14 predominate, lengths 11, 12 and 13 also occur. While 
any of these can be used in this invention, lengths 11 and 
14 are preferred. For length 11 the sequence is: 
|* 10 TG<2X4>L<4><4><4><3><4><4> and for Length 14 the sequence 

13 is: TG<1>SS<2>VG<1X3><2X3>VS, wherein <1> is 0.27 T, 0.27 

G and 0.027 each of ADE FH I KLMN PQRS VWY ; <2> is 0.27 D, 0.27 
N and 0.027 each of AEFGHIKLMPQRSTVWY; <3> is 0.36 Y and 
0.0355 each of ADE FGH I KLMN PQRS T VW ; and <4> is an equimolar 
15 mixture of amino acid residues ADEFGHIKLMNPQRSTVWY. Most 
preferably, mixtures (similar to those occurring in native 
H antibodies) preferably, the ratio is 11 : 14 : : 23 : 46 : : 0 . 33 : 

12 0.67 of the three lengths are used. 

H (c) CDR2 

20 In native human lambda chains, CDR2s with length 

7 are by far the most common. This length is preferred in 
this invention. The sequence of this Length 7 CDR2 is 
<4><4><4><2>RPS, wherein <2> is 0.27 D, 0.27 N, and 0.027 
each of AEFGHIKLMPQRSTVWY and <4> is an equimolar mixture 
25 of amino acid residues ADE FGH I KLMN PQRST VW . 
(d) CDR3 

In native human lambda chains, CDR3s of length 10 
and 11 predominate, while length 9 is also common. Any of 
these three lengths can be used in the invention. Length 
30 11 is preferred and mixtures of 10 and 11 more preferred. 

The sequence of Length 11 is <4><5><4><2><4>S<4><4><4X4>V, 
where <2> and <4> are as defined for the lambda CDR1 and 
<5> is 0.36 S and 0.0355 each of ADFFGHIKLMNPQRTVWY. The 



sequence of Length 10 is <5>SY<1><5>S<5><1><4>V, wherein 
<1> is an equimolar mixture of ADEFGHIKLMNPQRSTVWY ; and <4> 
and <5> are as defined for Length 11. The preferred 
mixtures of this invention comprise an equimolar mixture of 
Length 10 and Length 11. Table 8 shows a preferred focused 
lambda light chain diversity in accordance with this 
invention . 

Table 9 shows a series of diversity 
oligonucleotides and primers that may be used to construct 
the lambda chain diversities depicted in Table 7. 

Method of Construction of the Genetic Package 

The diversities of heavy chain and the kappa and 
lambda light chains are best constructed in separate 
vectors. First a synthetic gene is designed to embody each 
of the synthetic variable domains. The light chains are 
bounded by restriction sites for ApaLl (positioned at the 
very end of the signal sequence) and AscI (positioned afer 
the stop codon) . The heavy chain is bounded by Sfil 
(positioned within the PelB signal sequence) and NotI 
(positioned in the linker between CHI and the anchor 
protein) . Signal sequences other than PelB may also need, 
e.g., a M13 pill signal sequence. 

The initial genes are made with "stuffer" 
sequences in place of the desired CDRs . A "Stuffer" is a 
sequence that is to be cut away and replaced by diverse DNA 
but which does not allow expression of a functional 
antibody gene. For example, the stuffer may contain 
several stop codons and restriction sites that will not 
occur in the correct finished library vector. For example, 
in Table 10, the stuffer for CDR1 of kappa A27 contains a 
StuI site. The vgDNA for CDRl is introduced as a cassette 
from Espl, Xmal, or A fill to either SexAI or KasX. After 
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the ligation, the DNA is cleaved with Stul; there should be 
no Stul sites in the desired vectors. 

The sequences of the heavy chain gene with 
stuff ers is depicted in Table 6. The sequences of the 
5 kappa light chain gene with stuffers is depicted in Table 
10. The sequence of the lambda light chain gene with 
stuffers is depicted in Table 11. 

In another embodiment of the present intention 
the diversities of heavy chain and the kappa or lambda 
M= 10 light chains are constructed in a single vector or genetic 

J* packages (e.g., for display or display and expression) 

T : tar 

Jjj having appropriate restriction sites that allow cloning of 

■ % Q these chains. The processes to construct such vectors are 

!J[ well known and widely used in the art. Preferably, a heavy 

s 15 chain and Kappa light chain library and a heavy chain and 
I a * lambda light chain library would be prepared separately. 

|£ The two libraries, most preferably, will then be mixed in 

in 

;Z s equimolar amounts to attain maximum diversity. 

!•* Most preferably, the display is had on the 

20 surface of a derivative of M13 phage. The most preferred 
vector contains all the genes of M13, an antibiotic 
resistance gene, and the display cassette. The preferred 
vector is provided with restriction sites that allow 
introduction and excision of members of the diverse family 

25 of genes, as cassettes. The preferred vector is stable 
against rearrangement under the growth conditions used to 
amplify phage. 

In another embodiment of this invention, the 
diversity captured by the methods of the present invention 

30 may be displayed and/or expressed in a phagemid vector 

(e.g., pCESl) that displays and/or expresses the peptide, 
polypeptide or protein. Such vectors may also be used to 



store the diversity for subsequent display and/or 
expression using other vectors or phage. 

In another embodiment of this invention, the 
diversity captured by the methods of the present invention 
may be displayed and/or expressed in a yeast vector. 
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